Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 4119 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 675.9 KiB |
| Average record size in memory | 168.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 10 |
| BOOL | 1 |
euribor3m is highly correlated with emp.var.rate and 1 other fields | High correlation |
emp.var.rate is highly correlated with euribor3m | High correlation |
nr.employed is highly correlated with euribor3m | High correlation |
previous has 3523 (85.5%) zeros | Zeros |
Reproduction
| Analysis started | 2020-10-02 08:11:38.579598 |
|---|---|
| Analysis finished | 2020-10-02 08:12:04.139149 |
| Duration | 25.56 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
age
Real number (ℝ≥0)
| Distinct | 67 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.11361981 |
|---|---|
| Minimum | 18 |
| Maximum | 88 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 32 |
| median | 38 |
| Q3 | 47 |
| 95-th percentile | 58 |
| Maximum | 88 |
| Range | 70 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.31336155 |
|---|---|
| Coefficient of variation (CV) | 0.2571037367 |
| Kurtosis | 0.4381297604 |
| Mean | 40.11361981 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.7156939791 |
| Sum | 165228 |
| Variance | 106.3654264 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 32 | 216 | 5.2% | |
| 31 | 191 | 4.6% | |
| 30 | 177 | 4.3% | |
| 34 | 174 | 4.2% | |
| 35 | 172 | 4.2% | |
| 33 | 170 | 4.1% | |
| 36 | 168 | 4.1% | |
| 38 | 150 | 3.6% | |
| 41 | 147 | 3.6% | |
| 29 | 139 | 3.4% | |
| Other values (57) | 2415 | 58.6% |
| Value | Count | Frequency (%) | |
| 18 | 3 | 0.1% | |
| 19 | 1 | < 0.1% | |
| 20 | 4 | 0.1% | |
| 21 | 7 | 0.2% | |
| 22 | 10 | 0.2% |
| Value | Count | Frequency (%) | |
| 88 | 1 | < 0.1% | |
| 86 | 2 | < 0.1% | |
| 85 | 1 | < 0.1% | |
| 82 | 2 | < 0.1% | |
| 81 | 3 | 0.1% |
job
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| admin. | |
|---|---|
| blue-collar | |
| technician | |
| services | |
| management | |
| Other values (7) |
| Value | Count | Frequency (%) | |
| admin. | 1012 | 24.6% | |
| blue-collar | 884 | 21.5% | |
| technician | 691 | 16.8% | |
| services | 393 | 9.5% | |
| management | 324 | 7.9% | |
| retired | 166 | 4.0% | |
| self-employed | 159 | 3.9% | |
| entrepreneur | 148 | 3.6% | |
| unemployed | 111 | 2.7% | |
| housemaid | 110 | 2.7% | |
| Other values (2) | 121 | 2.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.992959456 |
| Min length | 6 |
marital
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| married | |
|---|---|
| single | |
| divorced | |
| unknown | 11 |
| Value | Count | Frequency (%) | |
| married | 2509 | 60.9% | |
| single | 1153 | 28.0% | |
| divorced | 446 | 10.8% | |
| unknown | 11 | 0.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.828356397 |
| Min length | 6 |
education
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| university.degree | |
|---|---|
| high.school | |
| basic.9y | |
| professional.course | |
| basic.4y | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| university.degree | 1264 | 30.7% | |
| high.school | 921 | 22.4% | |
| basic.9y | 574 | 13.9% | |
| professional.course | 535 | 13.0% | |
| basic.4y | 429 | 10.4% | |
| basic.6y | 228 | 5.5% | |
| unknown | 167 | 4.1% | |
| illiterate | 1 | < 0.1% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 19 |
|---|---|
| Median length | 11 |
| Mean length | 12.82131585 |
| Min length | 7 |
default
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| no | |
|---|---|
| unknown | |
| yes | 1 |
| Value | Count | Frequency (%) | |
| no | 3315 | 80.5% | |
| unknown | 803 | 19.5% | |
| yes | 1 | < 0.1% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.974993931 |
| Min length | 2 |
housing
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| yes | |
|---|---|
| no | |
| unknown | 105 |
| Value | Count | Frequency (%) | |
| yes | 2175 | 52.8% | |
| no | 1839 | 44.6% | |
| unknown | 105 | 2.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 2.655498908 |
| Min length | 2 |
loan
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| no | |
|---|---|
| yes | |
| unknown | 105 |
| Value | Count | Frequency (%) | |
| no | 3349 | 81.3% | |
| yes | 665 | 16.1% | |
| unknown | 105 | 2.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.288905074 |
| Min length | 2 |
contact
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| cellular | |
|---|---|
| telephone |
| Value | Count | Frequency (%) | |
| cellular | 2652 | 64.4% | |
| telephone | 1467 | 35.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.356154406 |
| Min length | 8 |
month
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| may | |
|---|---|
| jul | |
| aug | |
| jun | |
| nov | |
| Other values (5) |
| Value | Count | Frequency (%) | |
| may | 1378 | 33.5% | |
| jul | 711 | 17.3% | |
| aug | 636 | 15.4% | |
| jun | 530 | 12.9% | |
| nov | 446 | 10.8% | |
| apr | 215 | 5.2% | |
| oct | 69 | 1.7% | |
| sep | 64 | 1.6% | |
| mar | 48 | 1.2% | |
| dec | 22 | 0.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
day_of_week
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| thu | |
|---|---|
| mon | |
| tue | |
| wed | |
| fri |
| Value | Count | Frequency (%) | |
| thu | 860 | 20.9% | |
| mon | 855 | 20.8% | |
| tue | 841 | 20.4% | |
| wed | 795 | 19.3% | |
| fri | 768 | 18.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
duration
Real number (ℝ≥0)
| Distinct | 828 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 256.7880554 |
|---|---|
| Minimum | 0 |
| Maximum | 3643 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 103 |
| median | 181 |
| Q3 | 317 |
| 95-th percentile | 740.2 |
| Maximum | 3643 |
| Range | 3643 |
| Interquartile range (IQR) | 214 |
Descriptive statistics
| Standard deviation | 254.7037361 |
|---|---|
| Coefficient of variation (CV) | 0.9918831145 |
| Kurtosis | 20.76192927 |
| Mean | 256.7880554 |
| Median Absolute Deviation (MAD) | 92 |
| Skewness | 3.294781323 |
| Sum | 1057710 |
| Variance | 64873.99319 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 77 | 24 | 0.6% | |
| 112 | 23 | 0.6% | |
| 73 | 22 | 0.5% | |
| 81 | 21 | 0.5% | |
| 90 | 20 | 0.5% | |
| 83 | 20 | 0.5% | |
| 122 | 20 | 0.5% | |
| 145 | 20 | 0.5% | |
| 113 | 20 | 0.5% | |
| 131 | 19 | 0.5% | |
| Other values (818) | 3910 | 94.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 4 | 0.1% | |
| 6 | 5 | 0.1% | |
| 7 | 4 | 0.1% |
| Value | Count | Frequency (%) | |
| 3643 | 1 | < 0.1% | |
| 3253 | 1 | < 0.1% | |
| 2653 | 1 | < 0.1% | |
| 2301 | 1 | < 0.1% | |
| 1980 | 1 | < 0.1% |
campaign
Real number (ℝ≥0)
| Distinct | 25 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.537266327 |
|---|---|
| Minimum | 1 |
| Maximum | 35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 35 |
| Range | 34 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.568159238 |
|---|---|
| Coefficient of variation (CV) | 1.012175667 |
| Kurtosis | 25.28452046 |
| Mean | 2.537266327 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.003184952 |
| Sum | 10451 |
| Variance | 6.59544187 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 1764 | 42.8% | |
| 2 | 1039 | 25.2% | |
| 3 | 549 | 13.3% | |
| 4 | 291 | 7.1% | |
| 5 | 142 | 3.4% | |
| 6 | 99 | 2.4% | |
| 7 | 60 | 1.5% | |
| 8 | 36 | 0.9% | |
| 9 | 32 | 0.8% | |
| 10 | 20 | 0.5% | |
| Other values (15) | 87 | 2.1% |
| Value | Count | Frequency (%) | |
| 1 | 1764 | 42.8% | |
| 2 | 1039 | 25.2% | |
| 3 | 549 | 13.3% | |
| 4 | 291 | 7.1% | |
| 5 | 142 | 3.4% |
| Value | Count | Frequency (%) | |
| 35 | 1 | < 0.1% | |
| 29 | 2 | < 0.1% | |
| 27 | 1 | < 0.1% | |
| 24 | 1 | < 0.1% | |
| 23 | 2 | < 0.1% |
pdays
Real number (ℝ≥0)
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 960.4221899 |
|---|---|
| Minimum | 0 |
| Maximum | 999 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 999 |
| Q1 | 999 |
| median | 999 |
| Q3 | 999 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 191.9227858 |
|---|---|
| Coefficient of variation (CV) | 0.1998316863 |
| Kurtosis | 20.81248388 |
| Mean | 960.4221899 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.775139161 |
| Sum | 3955979 |
| Variance | 36834.35571 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 999 | 3959 | 96.1% | |
| 3 | 52 | 1.3% | |
| 6 | 42 | 1.0% | |
| 4 | 14 | 0.3% | |
| 7 | 10 | 0.2% | |
| 10 | 8 | 0.2% | |
| 12 | 5 | 0.1% | |
| 5 | 4 | 0.1% | |
| 2 | 4 | 0.1% | |
| 9 | 3 | 0.1% | |
| Other values (11) | 18 | 0.4% |
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 1 | 3 | 0.1% | |
| 2 | 4 | 0.1% | |
| 3 | 52 | 1.3% | |
| 4 | 14 | 0.3% |
| Value | Count | Frequency (%) | |
| 999 | 3959 | 96.1% | |
| 21 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 18 | 2 | < 0.1% | |
| 17 | 1 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1903374605 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 3523 |
| Zeros (%) | 85.5% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5417883234 |
|---|---|
| Coefficient of variation (CV) | 2.846461868 |
| Kurtosis | 22.12032347 |
| Mean | 0.1903374605 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.022978833 |
| Sum | 784 |
| Variance | 0.2935345874 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 3523 | 85.5% | |
| 1 | 475 | 11.5% | |
| 2 | 78 | 1.9% | |
| 3 | 25 | 0.6% | |
| 4 | 14 | 0.3% | |
| 6 | 2 | < 0.1% | |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3523 | 85.5% | |
| 1 | 475 | 11.5% | |
| 2 | 78 | 1.9% | |
| 3 | 25 | 0.6% | |
| 4 | 14 | 0.3% |
| Value | Count | Frequency (%) | |
| 6 | 2 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 4 | 14 | 0.3% | |
| 3 | 25 | 0.6% | |
| 2 | 78 | 1.9% |
poutcome
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| nonexistent | |
|---|---|
| failure | |
| success | 142 |
| Value | Count | Frequency (%) | |
| nonexistent | 3523 | 85.5% | |
| failure | 454 | 11.0% | |
| success | 142 | 3.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.42121874 |
| Min length | 7 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0849720806 |
|---|---|
| Minimum | -3.4 |
| Maximum | 1.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | -3.4 |
|---|---|
| 5-th percentile | -2.9 |
| Q1 | -1.8 |
| median | 1.1 |
| Q3 | 1.4 |
| 95-th percentile | 1.4 |
| Maximum | 1.4 |
| Range | 4.8 |
| Interquartile range (IQR) | 3.2 |
Descriptive statistics
| Standard deviation | 1.563114456 |
|---|---|
| Coefficient of variation (CV) | 18.39562413 |
| Kurtosis | -1.041783886 |
| Mean | 0.0849720806 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.7276878782 |
| Sum | 350 |
| Variance | 2.443326802 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.4 | 1626 | 39.5% | |
| -1.8 | 883 | 21.4% | |
| 1.1 | 758 | 18.4% | |
| -0.1 | 392 | 9.5% | |
| -2.9 | 164 | 4.0% | |
| -3.4 | 104 | 2.5% | |
| -1.7 | 87 | 2.1% | |
| -1.1 | 83 | 2.0% | |
| -3 | 21 | 0.5% | |
| -0.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -3.4 | 104 | 2.5% | |
| -3 | 21 | 0.5% | |
| -2.9 | 164 | 4.0% | |
| -1.8 | 883 | 21.4% | |
| -1.7 | 87 | 2.1% |
| Value | Count | Frequency (%) | |
| 1.4 | 1626 | 39.5% | |
| 1.1 | 758 | 18.4% | |
| -0.1 | 392 | 9.5% | |
| -0.2 | 1 | < 0.1% | |
| -1.1 | 83 | 2.0% |
cons.price.idx
Real number (ℝ≥0)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.5797043 |
|---|---|
| Minimum | 92.201 |
| Maximum | 94.767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | 92.201 |
|---|---|
| 5-th percentile | 92.713 |
| Q1 | 93.075 |
| median | 93.749 |
| Q3 | 93.994 |
| 95-th percentile | 94.465 |
| Maximum | 94.767 |
| Range | 2.566 |
| Interquartile range (IQR) | 0.919 |
Descriptive statistics
| Standard deviation | 0.579348805 |
|---|---|
| Coefficient of variation (CV) | 0.0061909664 |
| Kurtosis | -0.8233578937 |
| Mean | 93.5797043 |
| Median Absolute Deviation (MAD) | 0.38 |
| Skewness | -0.2166414217 |
| Sum | 385454.802 |
| Variance | 0.3356450378 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 93.994 | 758 | 18.4% | |
| 93.918 | 667 | 16.2% | |
| 92.893 | 597 | 14.5% | |
| 93.444 | 528 | 12.8% | |
| 94.465 | 431 | 10.5% | |
| 93.2 | 386 | 9.4% | |
| 93.075 | 201 | 4.9% | |
| 92.201 | 75 | 1.8% | |
| 92.963 | 75 | 1.8% | |
| 92.431 | 43 | 1.0% | |
| Other values (16) | 358 | 8.7% |
| Value | Count | Frequency (%) | |
| 92.201 | 75 | 1.8% | |
| 92.379 | 25 | 0.6% | |
| 92.431 | 43 | 1.0% | |
| 92.469 | 14 | 0.3% | |
| 92.649 | 36 | 0.9% |
| Value | Count | Frequency (%) | |
| 94.767 | 24 | 0.6% | |
| 94.601 | 20 | 0.5% | |
| 94.465 | 431 | 10.5% | |
| 94.215 | 30 | 0.7% | |
| 94.199 | 39 | 0.9% |
cons.conf.idx
Real number (ℝ)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -40.49910172 |
|---|---|
| Minimum | -50.8 |
| Maximum | -26.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | -50.8 |
|---|---|
| 5-th percentile | -47.1 |
| Q1 | -42.7 |
| median | -41.8 |
| Q3 | -36.4 |
| 95-th percentile | -33.6 |
| Maximum | -26.9 |
| Range | 23.9 |
| Interquartile range (IQR) | 6.3 |
Descriptive statistics
| Standard deviation | 4.594577507 |
|---|---|
| Coefficient of variation (CV) | -0.1134488745 |
| Kurtosis | -0.3143030044 |
| Mean | -40.49910172 |
| Median Absolute Deviation (MAD) | 4.4 |
| Skewness | 0.2873090796 |
| Sum | -166815.8 |
| Variance | 21.11014247 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -36.4 | 758 | 18.4% | |
| -42.7 | 667 | 16.2% | |
| -46.2 | 597 | 14.5% | |
| -36.1 | 528 | 12.8% | |
| -41.8 | 431 | 10.5% | |
| -42 | 386 | 9.4% | |
| -47.1 | 201 | 4.9% | |
| -31.4 | 75 | 1.8% | |
| -40.8 | 75 | 1.8% | |
| -26.9 | 43 | 1.0% | |
| Other values (16) | 358 | 8.7% |
| Value | Count | Frequency (%) | |
| -50.8 | 24 | 0.6% | |
| -50 | 25 | 0.6% | |
| -49.5 | 20 | 0.5% | |
| -47.1 | 201 | 4.9% | |
| -46.2 | 597 | 14.5% |
| Value | Count | Frequency (%) | |
| -26.9 | 43 | 1.0% | |
| -29.8 | 25 | 0.6% | |
| -30.1 | 36 | 0.9% | |
| -31.4 | 75 | 1.8% | |
| -33 | 21 | 0.5% |
| Distinct | 234 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.621355669 |
|---|---|
| Minimum | 0.635 |
| Maximum | 5.045 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | 0.635 |
|---|---|
| 5-th percentile | 0.8084 |
| Q1 | 1.334 |
| median | 4.857 |
| Q3 | 4.961 |
| 95-th percentile | 4.966 |
| Maximum | 5.045 |
| Range | 4.41 |
| Interquartile range (IQR) | 3.627 |
Descriptive statistics
| Standard deviation | 1.733591223 |
|---|---|
| Coefficient of variation (CV) | 0.4787133276 |
| Kurtosis | -1.396366286 |
| Mean | 3.621355669 |
| Median Absolute Deviation (MAD) | 0.108 |
| Skewness | -0.7150798684 |
| Sum | 14916.364 |
| Variance | 3.005338527 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4.857 | 274 | 6.7% | |
| 4.963 | 256 | 6.2% | |
| 4.962 | 237 | 5.8% | |
| 4.961 | 212 | 5.1% | |
| 4.856 | 138 | 3.4% | |
| 4.965 | 114 | 2.8% | |
| 4.964 | 110 | 2.7% | |
| 1.405 | 106 | 2.6% | |
| 4.96 | 105 | 2.5% | |
| 4.968 | 101 | 2.5% | |
| Other values (224) | 2466 | 59.9% |
| Value | Count | Frequency (%) | |
| 0.635 | 3 | 0.1% | |
| 0.636 | 1 | < 0.1% | |
| 0.637 | 1 | < 0.1% | |
| 0.639 | 2 | < 0.1% | |
| 0.64 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5.045 | 1 | < 0.1% | |
| 4.97 | 21 | 0.5% | |
| 4.968 | 101 | 2.5% | |
| 4.967 | 62 | 1.5% | |
| 4.966 | 72 | 1.7% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5166.481695 |
|---|---|
| Minimum | 4963.6 |
| Maximum | 5228.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 32.2 KiB |
Quantile statistics
| Minimum | 4963.6 |
|---|---|
| 5-th percentile | 5008.7 |
| Q1 | 5099.1 |
| median | 5191 |
| Q3 | 5228.1 |
| 95-th percentile | 5228.1 |
| Maximum | 5228.1 |
| Range | 264.5 |
| Interquartile range (IQR) | 129 |
Descriptive statistics
| Standard deviation | 73.66790356 |
|---|---|
| Coefficient of variation (CV) | 0.01425881439 |
| Kurtosis | 0.0617241978 |
| Mean | 5166.481695 |
| Median Absolute Deviation (MAD) | 37.1 |
| Skewness | -1.075876888 |
| Sum | 21280738.1 |
| Variance | 5426.960015 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5228.1 | 1626 | 39.5% | |
| 5099.1 | 823 | 20.0% | |
| 5191 | 758 | 18.4% | |
| 5195.8 | 392 | 9.5% | |
| 5076.2 | 164 | 4.0% | |
| 5017.5 | 104 | 2.5% | |
| 4991.6 | 87 | 2.1% | |
| 4963.6 | 83 | 2.0% | |
| 5008.7 | 60 | 1.5% | |
| 5023.5 | 21 | 0.5% |
| Value | Count | Frequency (%) | |
| 4963.6 | 83 | 2.0% | |
| 4991.6 | 87 | 2.1% | |
| 5008.7 | 60 | 1.5% | |
| 5017.5 | 104 | 2.5% | |
| 5023.5 | 21 | 0.5% |
| Value | Count | Frequency (%) | |
| 5228.1 | 1626 | 39.5% | |
| 5195.8 | 392 | 9.5% | |
| 5191 | 758 | 18.4% | |
| 5176.3 | 1 | < 0.1% | |
| 5099.1 | 823 | 20.0% |
y
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.2 KiB |
| no | |
|---|---|
| yes |
| Value | Count | Frequency (%) | |
| no | 3668 | 89.1% | |
| yes | 451 | 10.9% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 30 | blue-collar | married | basic.9y | no | yes | no | cellular | may | fri | 487 | 2 | 999 | 0 | nonexistent | -1.8 | 92.893 | -46.2 | 1.313 | 5099.1 | no |
| 1 | 39 | services | single | high.school | no | no | no | telephone | may | fri | 346 | 4 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.855 | 5191.0 | no |
| 2 | 25 | services | married | high.school | no | yes | no | telephone | jun | wed | 227 | 1 | 999 | 0 | nonexistent | 1.4 | 94.465 | -41.8 | 4.962 | 5228.1 | no |
| 3 | 38 | services | married | basic.9y | no | unknown | unknown | telephone | jun | fri | 17 | 3 | 999 | 0 | nonexistent | 1.4 | 94.465 | -41.8 | 4.959 | 5228.1 | no |
| 4 | 47 | admin. | married | university.degree | no | yes | no | cellular | nov | mon | 58 | 1 | 999 | 0 | nonexistent | -0.1 | 93.200 | -42.0 | 4.191 | 5195.8 | no |
| 5 | 32 | services | single | university.degree | no | no | no | cellular | sep | thu | 128 | 3 | 999 | 2 | failure | -1.1 | 94.199 | -37.5 | 0.884 | 4963.6 | no |
| 6 | 32 | admin. | single | university.degree | no | yes | no | cellular | sep | mon | 290 | 4 | 999 | 0 | nonexistent | -1.1 | 94.199 | -37.5 | 0.879 | 4963.6 | no |
| 7 | 41 | entrepreneur | married | university.degree | unknown | yes | no | cellular | nov | mon | 44 | 2 | 999 | 0 | nonexistent | -0.1 | 93.200 | -42.0 | 4.191 | 5195.8 | no |
| 8 | 31 | services | divorced | professional.course | no | no | no | cellular | nov | tue | 68 | 1 | 999 | 1 | failure | -0.1 | 93.200 | -42.0 | 4.153 | 5195.8 | no |
| 9 | 35 | blue-collar | married | basic.9y | unknown | no | no | telephone | may | thu | 170 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.855 | 5191.0 | no |
Last rows
| age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4109 | 63 | retired | married | high.school | no | no | no | cellular | oct | wed | 1386 | 1 | 999 | 0 | nonexistent | -3.4 | 92.431 | -26.9 | 0.740 | 5017.5 | no |
| 4110 | 53 | housemaid | divorced | basic.6y | unknown | unknown | unknown | telephone | may | fri | 85 | 2 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.855 | 5191.0 | no |
| 4111 | 30 | technician | married | university.degree | no | no | yes | cellular | jun | fri | 131 | 1 | 999 | 1 | failure | -1.7 | 94.055 | -39.8 | 0.748 | 4991.6 | no |
| 4112 | 31 | technician | single | professional.course | no | yes | no | cellular | nov | thu | 155 | 1 | 999 | 0 | nonexistent | -0.1 | 93.200 | -42.0 | 4.076 | 5195.8 | no |
| 4113 | 31 | admin. | single | university.degree | no | yes | no | cellular | nov | thu | 463 | 1 | 999 | 0 | nonexistent | -0.1 | 93.200 | -42.0 | 4.076 | 5195.8 | no |
| 4114 | 30 | admin. | married | basic.6y | no | yes | yes | cellular | jul | thu | 53 | 1 | 999 | 0 | nonexistent | 1.4 | 93.918 | -42.7 | 4.958 | 5228.1 | no |
| 4115 | 39 | admin. | married | high.school | no | yes | no | telephone | jul | fri | 219 | 1 | 999 | 0 | nonexistent | 1.4 | 93.918 | -42.7 | 4.959 | 5228.1 | no |
| 4116 | 27 | student | single | high.school | no | no | no | cellular | may | mon | 64 | 2 | 999 | 1 | failure | -1.8 | 92.893 | -46.2 | 1.354 | 5099.1 | no |
| 4117 | 58 | admin. | married | high.school | no | no | no | cellular | aug | fri | 528 | 1 | 999 | 0 | nonexistent | 1.4 | 93.444 | -36.1 | 4.966 | 5228.1 | no |
| 4118 | 34 | management | single | high.school | no | yes | no | cellular | nov | wed | 175 | 1 | 999 | 0 | nonexistent | -0.1 | 93.200 | -42.0 | 4.120 | 5195.8 | no |